out-of-support region
Country:
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
- (10 more...)
Industry:
- Marketing (0.46)
- Information Technology (0.46)
Technology:
Offline Model-based Adaptable Policy Learning Xiong-Hui Chen 1, Y ang Y u
In reinforcement learning, a promising direction to avoid online trial-and-error costs is learning from an offline dataset. Current offline reinforcement learning methods commonly learn in the policy space constrained to in-support regions by the offline dataset, in order to ensure the robustness of the outcome policies.
Country:
- Asia > China > Jiangsu Province > Nanjing (0.04)
- Africa > Ethiopia > Addis Ababa > Addis Ababa (0.04)
- South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
- (10 more...)
Industry:
- Marketing (0.46)
- Information Technology (0.46)
Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)